Genotype imputation of Metabochip SNPs using a study-specific reference panel of ~4,000 haplotypes in African Americans from the Women's Health Initiative.

نویسندگان

  • Eric Yi Liu
  • Steven Buyske
  • Aaron K Aragaki
  • Ulrike Peters
  • Eric Boerwinkle
  • Chris Carlson
  • Cara Carty
  • Dana C Crawford
  • Jeff Haessler
  • Lucia A Hindorff
  • Loic Le Marchand
  • Teri A Manolio
  • Tara Matise
  • Wei Wang
  • Charles Kooperberg
  • Kari E North
  • Yun Li
چکیده

Genetic imputation has become standard practice in modern genetic studies. However, several important issues have not been adequately addressed including the utility of study-specific reference, performance in admixed populations, and quality for less common (minor allele frequency [MAF] 0.005-0.05) and rare (MAF < 0.005) variants. These issues only recently became addressable with genome-wide association studies (GWAS) follow-up studies using dense genotyping or sequencing in large samples of non-European individuals. In this work, we constructed a study-specific reference panel of 3,924 haplotypes using African Americans in the Women's Health Initiative (WHI) genotyped on both the Metabochip and the Affymetrix 6.0 GWAS platform. We used this reference panel to impute into 6,459 WHI SNP Health Association Resource (SHARe) study subjects with only GWAS genotypes. Our analysis confirmed the imputation quality metric Rsq (estimated r(2) , specific to each SNP) as an effective post-imputation filter. We recommend different Rsq thresholds for different MAF categories such that the average (across SNPs) Rsq is above the desired dosage r(2) (squared Pearson correlation between imputed and experimental genotypes). With a desired dosage r(2) of 80%, 99.9% (97.5%, 83.6%, 52.0%, 20.5%) of SNPs with MAF > 0.05 (0.03-0.05, 0.01-0.03, 0.005-0.01, and 0.001-0.005) passed the post-imputation filter. The average dosage r(2) for these SNPs is 94.7%, 92.1%, 89.0%, 83.1%, and 79.7%, respectively. These results suggest that for African Americans imputation of Metabochip SNPs from GWAS data, including low frequency SNPs with MAF 0.005-0.05, is feasible and worthwhile for power increase in downstream association analysis provided a sizable reference panel is available.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessment of Genotype Imputation Performance Using 1000 Genomes in African American Studies

Genotype imputation, used in genome-wide association studies to expand coverage of single nucleotide polymorphisms (SNPs), has performed poorly in African Americans compared to less admixed populations. Overall, imputation has typically relied on HapMap reference haplotype panels from Africans (YRI), European Americans (CEU), and Asians (CHB/JPT). The 1000 Genomes project offers a wider range o...

متن کامل

Estimation of genotype imputation accuracy using reference populations with varying degrees of relationship and marker density panel

Genotype imputation from low-density to high-density (SNP) chips is an important step before applying genomic selection, because denser chips can provide more reliable genomic predictions. In the current research, the accuracy of genotype imputation from low and moderate-density panels (5K and 50K) to high-density panels in the purebred and crossbred populations was assessed. The simulated popu...

متن کامل

Prospective Associations of Coronary Heart Disease Loci in African Americans Using the MetaboChip: The PAGE Study

BACKGROUND Coronary heart disease (CHD) is a leading cause of morbidity and mortality in African Americans. However, there is a paucity of studies assessing genetic determinants of CHD in African Americans. We examined the association of published variants in CHD loci with incident CHD, attempted to fine map these loci, and characterize novel variants influencing CHD risk in African Americans. ...

متن کامل

Imputation of parent-offspring trios and their effect on accuracy of genomic prediction using Bayesian method

The objective of this study was to evaluate the imputation accuracy of parent-offspring trios under different scenarios. By using simulated datasets, the performance Bayesian LASSO in genomic prediction was also examined. The genome consisted of 5 chromosomes and each chromosome was set as 1 Morgan length. The number of SNPs per chromosome was 10000. One hundred QTLs were randomly distributed a...

متن کامل

Multiple Mental Disorders and Suicidality; Cross-Ethnic Variation among Blacks

Background: For psychiatric disorders, comorbidity is a rule rather than exception. Thus it is particularly important to study additive and multiplicative effects of multiple mental disorders on suicidal behaviors. Objectives: The aim of this study was to investigate the ethnic differences in multiplicative effects of mental disorders on suicidal ideation among Black adults in the United States...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genetic epidemiology

دوره 36 2  شماره 

صفحات  -

تاریخ انتشار 2012